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The Translationally Controlled Tumor Protein (TCTP) is a highly conserved protein at the 
level of sequence, considered to play an essential role in the regulation of growth and 
development in eukaryotes. However, this function has been inferred from studies in a 
few model systems, such as mice and mammalian cell lines, Drosophila and Arabidopsis. 
Thus, the knowledge regarding this protein is far from complete. In the present study 
bioinformatic analysis showed the presence of one or more TCTP genes per genome 
in plants with highly conserved signatures and subtle variations at the level of primary 
structure but with more noticeable differences at the level of predicted three-dimensional 
structures. These structures show differences in the "pocket" region close to the center 
of the protein and in its flexible loop domain. In fact, all predictive TCTP structures can be 
divided into two groups: (1) AtTCTPI-like and (2) CmTCTP-like, based on the predicted 
structures of an Arabidopsis TCTP and a Cucurbita maxima TCTP; according to this 
classification we propose that their probable function in plants may be inferred in principle. 
Thus, different TCTP genes in a single organism may have different functions; additionally, 
in those species harboring a single TCTP gene this could carry multiple functions. On 
the other hand, in silico analysis of AtTCTPI-Wke and CmTCrP-like promoters suggest 
that these share common motifs but with different abundance, which may underscore 
differences in their gene expression patterns. Finally, the absence of TCTP genes in 
most chlorophytes with the exception of Coccomyxa subellipsoidea, indicates that other 
proteins perform the roles played by TCTP or the pathways regulated by TCTP occur 
through alternative routes. These findings provide insight into the evolution of this gene 
family in plants. 

Keywords: Translationally Controlled Tumor Protein, Arabidopsis ttialiana, evolution, phylogeny, GTP-binding 
pocket 



INTRODUCTION 

The Translationally Controlled Tumor Proteins, or TCTP, consti- 
tute a protein family found exclusively in eukaryotes, which are 
central for growth regulation (Bommer et al., 2002). Sequence 
comparison of TCTP sequences reveals a high degree of conser- 
vation among all eukaryotic phyla (Hinojosa-Moya et al., 2008). 
Such conservation, as well as its nearly ubiquitous expression 
underscores its essential role in the regulation of growth and 
development of different organisms (Thaw et al., 2001; Bommer 
and Thiele, 2004; Amson et al, 2013). 

Currently, work on TCTP has focused mostly on animals, par- 
ticularly models like human, mice and Drosophila, which has 
shown it to be involved in several biological processes such as cell 
growth, cell cycle progression, differentiation, malignant trans- 
formation, protection against various stress conditions and apop- 
tosis (Amson et al, 2013). Studies on TCTP function in plants 
have been conducted with few experimental systems, mostly 
Arabidopsis. Although the molecular function of TCTP in plants 
has not been completely established yet, it is likely that in broad 
terms its function is similar, e.g., growth and developmental 
regulation since such function appears to be conserved across 
kingdoms; indeed a Drosophila TCTP mutant can be rescued 



with the corresponding Arabidopsis TCTP gene, and vice versa 
(Brioudes et al., 2010). Extant data suggest that TCTP is consti- 
tutively expressed at high levels in most tissues in different plants 
species; there is also evidence that TCTP expression is affected by 
a variety of conditions (Bommer and Thiele, 2004; Nagano-Ito 
and Ichikawa, 2012; Amson et al., 2013). Indeed, TCTP mRNA 
levels vary considerably in response to a wide range of extracellu- 
lar stimuli and in multiple seemingly unrelated cellular processes 
(Bommer and Thiele, 2004). 

The first plant TCTP mRNA sequence was obtained in 
Medicago sativa (Pay et al., 1992). The notion that its expres- 
sion in plants correlates positively with growth was supported by 
the fact that TCTP mRNA accumulated in the root cap of Pisum 
sativum, a region undergoing continuous cell division (Woo and 
Hawes, 1997). Furthermore, possible functions related to pho- 
toperiodism and flowering in Pitarbitis nil have been suggested 
(Sage-Ono et al., 1998). TCTP is regulated in response to a wide 
range of stimuli, such as aluminum (Ermolayev et al., 2003), dam- 
age caused by Hg^+ (Wang et al., 2012) and NaCl (Vincent et al., 
2007; Cao et al, 2010), heat, cold, and drought (Kim et al, 2012; 
Li et al., 2013), as well as growth regulators such as auxins, ABA 
(Berkowitz et al, 2008; Cao et al, 2010; Kim et al, 2012), and 
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methyl jasmonate (Li et al, 2013). TCTP also seems to be involved 
in response to pathogens such as Pseudomonas syringae (lones 
et al., 2006) and Erysiphe graminis, the causal agent of powdery 
mildew in wheat (Li et al., 2010). 

On the other hand, TCTP has been detected in the phloem 
sap of Ricinus communis and Cucurbita maxima, suggesting its 
phloem long-distance movement. This in turn suggests that these 
proteins, at least in some species, act in a non-ceU-autonomous 
manner (Barnes et al, 2004; Aoki et al., 2005; Hinojosa-Moya 
et al, 2006). It is worth mentioning that lupin TCTP mRNA 
is present in the phloem sap transcriptome, and the pumpkin 
TCTP mRNA has been found in phloem sap exudates (Rodriguez- 
Medina et al, 2011; Hinojosa-Moya et al, 2013). 

High expression levels of this gene have also been detected in 
embryo and endosperm during embryo development in Jatropha 
curcas and Ricinus communis, indicating that TCTP participates 
in this process (Lu et al, 2007; Qin et al., 2011). Indeed, loss of 
function of Arabidopsis AtTCTPl leads to delayed embryo devel- 
opment (Brioudes et al., 2010). AtTCTPl is expressed throughout 
plant tissues and developmental stages, particularly in meristem- 
atic and expanding cells; moreover, and by analogy to Drosophila 
TCTP, it might act as a mediator of TOR activity via interac- 
tion with Rho GTPases in a similar manner to non-plant systems 
(Berkowitz et al., 2008), although the involvement of TCTP in the 
TOR pathway is still debated (Wang et al, 2008). Other results 
suggest that AtTCTPl controls mitotic growth and cell division 
but not cell expansion in plants (Brioudes et al., 2010) and cell 
cycle progression (Nakkaew et al., 2010). 

There are variable numbers of TCTP genes in different organ- 
isms, while mammals harbor the highest number of TCTP-like 
sequence per genome (for example, human, mouse and rat 
each harbor 3 TCTP genes and a larger number of pseudo- 
genes), plants and fungi in general contain one or two TCTP 
genes (Hinojosa-Moya et al, 2008; Figure 1). It is possible that 



in organisms with multiple TCTP genes these are redundant; 
however, in some cases it is also possible that each performs 
non-overlapping or partially overlapping functions. 

Currently, several TCTP genes have been isolated and cloned 
from different plants species. However, there are few studies 
regarding their functional characterization. The following table 
presents information about TCTP in different plant species and 
its identified function (Table 1). These published data suggest 
that TCTP not only regulates general growth but also carries out 
functions that are specific for plants, such as auxin homeosta- 
sis, regulation of stomatal closure, defense response to bacteria, 
positive regulation of microtubule depolymerization, drought 
recovery, pollen tube growth, among others. Based on this wide 
range of functions and the different TCTP versions present in 
plants, we suggest a specialization of functions (or division of 
labor) of this multifunctional protein. In the present work we pro- 
pose that, based on an Arabidopsis [AtTCTPl ) and a Cucurbita 
maxima (pumpkin) TCTP (CmTCTP) genes, their probable func- 
tions and predicted three-dimensional structures, there are two 
groups of TCTP genes, an AtTCTPl -like clade and a CmTCTP- 
like clade, and their respective functions may be inferred from 
such grouping. We also found that only one chlorophyte, among 
those whose genomes have been sequenced, harboring a TCTP 
gene, with possible implications on land plant evolution. 

MATERIALS AND METHODS 
TCTP SEQUENCES 

Protein sequences used in this analysis were retrieved from 
Phytozome database (http://www.phytozome.net/). As a start- 
ing point, the virtual translation derived from the nucleotide 
sequence of Cucurbita maxima TCTP (CmTCTP; Genbank acces- 
sion number DQ304537) was obtained from the GenBank 
database (http://www.ncbi.nlm.nih.gov/nuccore/). Afterwards, a 
BLAST search was carried out using the corresponding CmTCTP 



Number of sequences by species 




FIGURE 1 I Number of TCTP sequences by plant species (Source: Phytozome). 
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Table 1 | List of TCTP genes present in different plant species and their functions. 



Species 


Gene 


Function 




Gene Banl< accession number 




Jatropha curcas (Physic Nut) 


JcTCTP EF091818 


Regulation of the endosperm development (Qin et al., 2011 ) 


Cucurbita maxima (Pumpkin) 


CmTCTP DQ304537 


Non-cell-autonomous function and long-distance movement of phloem 






proteins (Aoki et al., 2005; Hinojosa-Moya et al., 2006). 


Arabidopsis thaliana (Tlnale cress) 


AtTCTP NM112537 


Regulator of mitotic growth and cell cycle duration (Brioudes et al., 2010) 






Growth regulator, auxin homeostasis, root hair development and lateral 






root formation (Berkowitz et al., 2008) 






Drought tolerance, ABA-mediated stomatal closure, tubulin-, and calcium- 






binding properties (Kim et al., 2012) 


Fragaria ananassa (Strawberry) 


FaTCTP Z86091 


Fruit ripening? (Lopez and Franco, 2006) 


Elaeis guineensis (Oil palm) 


EgTCTP AAQ87663.1 


Cell growth and cell cycle progression (Nakkaew et al., 2010) 


Brassica oleracea (Cabbage) 


BoTCTP AF41 8663 


Growth regulation and defense response to cold, high temperature, and 






salinity stresses (Cao et al., 2010). 


Pharbitis nil (IVIorning glory) 


PnTCTP AB007759 


Photoperiodism, flowering? (Sage-Ono et al., 1998) 


Oriza sativa (Rice) 


OsTCTP BAA02151 


Response to Hg^+ stress (Wang et al., 2012) 


Nicotiana tabacum (Tobacco) 


NtTCTP API 07842 


Cell cycle progression (Brioudes et al., 2010) 


Triticum aestivum (Wheat) 


la 1 U 1 r AroUoy/U 


Powdery mildew resistance? (Li et al., 2010) 


Ricinus communis (Castor bean) 


RcTCTP RCOM_1433410 


Endosperm formation? (Lu et al., 2007) 


Vitis vinifera (Grape Vine) 


VvTCTP NC_012020 


Response to water deficit and salinity (Vincent et al., 2007) 


IVIanihot sculenta (Cassava) 


MsTCTP AAM55492 


Storage root formation (de Souza et al., 2004) 


Hevea brasiiiensis (Rubber tree) 


HbTCTP JN200814 


Response to ethrel, wounding, methyl jasmonate, low temperature, high 






salt, H2O2, and drought (Li et al., 2013) 


Pisum sativum (Common pea) 


PsTCTPAAB 19090 


Cell division (Woo and Hawes, 1997) 



protein sequence as query into the proteome Phytozome 
database. Based on this search, sequences representative of differ- 
ent plant species were selected for further study. Sequences that 
diverged from the consensus and were thus probably sequencing 
artifacts were not used for further analysis. 

IDENTITY AND SIMILARITY OF TCTP PROTEIN SEQUENCES 

The selected protein sequences of TCTP from different organ- 
isms were entered into MatGAT software version 2.01 (Matrix 
Global Alignment Tool) to calculate sequence similarity and iden- 
tity (http://bitincka.com/ledion/matgat/). The series of pairwise 
alignments (shown as distance matrix) were inferred using the 
BLOSUM62 scoring matrix and the following parameters: num- 
ber of first gap 10, gap extension 2. The analysis involved 59 amino 
acid sequences. Values were expressed as percent of similarity and 
identity between the TCTP protein sequences from different plant 
species compared with AtTCTPl (At3gl6640) from A. thaliana 
and CmTCTP from C. maxima. 

PHYLOGENETIC ANALYSIS 

The amino acid sequences chosen previously were aligned with 
the MUSCLE program (http://www.ebi.ac.uk/Tools/msa/muscle) 
using the profile method (Edgar, 2004). ProtTest program ver- 
sion 2.4 (Abascal et al, 2005) was used for the selection of the 
model of protein evolution that best fitted the set of sequences. 
Evolutionary analysis was conducted in PhyML 3.0 (Guindon 
et al, 2010) and the Molecular Evolutionary Genetics Analysis 
Program (MEGA) version 5 (Tamura et al., 2014). The evolu- 
tionary phylogenetic reconstruction (shown as dendrogram) was 
inferred using the LG-fG evolution method and the following 



parameters: number of substitution rate categories 4, gamma 
shape parameter 0.642, and 1000 replicate bootstrap search with 
heuristic search options. Branches with <60% bootstrap support 
were collapsed. The tree was rooted with the Cyanidioschyzon 
merolae TCTP sequence, which belongs to the red algae group. 
The analysis involved 59 amino acid sequences. 

PREDICTED TERTIARY STRUCTURE OF TCTP 

Full-length protein sequences were selected from the Phytozome 
database (http://phytozome.net) and refined through a com- 
parison with the NCBI database (http://www.ncbi.nlm.nih.gov). 
Amino acid sequences were submitted to the automated protein 
structure homology-modeling program server SWISS-MODEL 
(http://swissmodel.expasy.org/), this server builds a model for 
each protein target using as templates, homologous protein struc- 
tures which have been experimentally determined (Arnold et al., 
2006; Kiefer et al, 2009). Swiss PDB Viewer application (http:// 
www.expasy.org/spdbv/) was used for visualizing predictive 3D 
structures (Guex and Peitsch, 1997). 

PROMOTER ANALYSIS 

Upstream sequences (l.Okbp) from TCTP genes were retrieved 
from the Phytozome database (http://www.phytozome.net/). The 
list of genes and the species from which these upstream regions 
were obtained is listed in Table SI. These were classified accord- 
ing to their structural similarity to either AtTCTPl or CmTCTP 
proteins from Arabidopsis and pumpkin, respectively. Also, since 
several of these genes have not been characterized, their 5'UTRs 
have not been defined; in these cases, 1 kb sequences upstream of 
the start codon were analyzed. 
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Promoter analysis was performed using the MEME soft- 
ware (http://meme.sdsc.edu/meme/meme.html, Bailey et al., 
2006) with the following settings: one or more occurrence, per 
sequence, and a 10 base motif length. Based on sequence anal- 
ysis, using ALIGNACE, as well as visual inspection of the pro- 
moter sets, it was determined that CT/GA repeats were present 
more than once in the analyzed upstream sequences and, thus, 
the ZOOPS and OOPS models were not employed. As back- 
ground model the shuffled sequences were used and also ana- 
lyzed with MEME (http://www.bioinformatics.org/sms2/about. 
html). Motifs were graphically represented with the WEBLOGO 
program (http://weblogo.berkeley.edu/logo.cgi). As background 
models, the shuffled sequences were likewise analyzed using the 
MEME algorithm. 

RESULTS 

PLANTS HARBOR VARIABLE NUMBERS OF TCTP SEQUENCES 

A BLASTP search was carried out for each of the plant species 
for which its genome has been sequenced using the CmTCTP 
protein sequence as query, after which the corresponding TCTP 
sequences were retrieved. The list of 59 TCTP sequences from 
different plant species used in this study is shown (Table SI). 
Based on the TCTP amino acid sequences retrieved from 
Phytozome database, the species that have only a single TCTP 
gene are: Linum usitatissimum, Cucurbita maxima, Prunus per- 
sica, Malus domestica, Fragaria vesca, Citrus sinensis, Solanum 
tuberosum, Solanum lycopersicum. Sorghum bicolor, Zea mays, 
Setaria italica, Panicum virgatum, Oryza sativa, Brachypodium dis- 
tachyon, Selaginella moellendorffi, Coccomyxa subellipsoidea and 
Cyanidioschyzon merolae. Species that harbor 2 TCTP gene copies 
are: Ricinus communis, Medicago truncatula, Phaseolus vulgaris. 
Glycine max, Cucumis sativus, Cucumis melo, Arabidopsis thaliana, 
Arabidopsis lyrata, Brassica rapa, Thellungiella halophila, Citrus 
Clementina, Citrullus lanatus, Vitis vimfera, Mimulus guttatus and 
Physcomitrella patens. Finally, the species that harbors 3 TCTP 
genes are: Manihot esculenta, Populus trichocarpa, Gossypium rai- 
mondii and Aquilegia coerulea (Table 2). 

Pairwise BLAST is used to calculate similarity, but its limita- 
tions are that only two sequences may be analyzed at one time and 
percent of similarity/identity is based on local alignment instead 
of a global alignment. Matrix Global Alignment Tool (MatGAT) is 
a simple, easy to use similarity/identity matrix generator that cal- 
culates the similarity and identity between every pair of sequences 
in a given data set without requiring pre-alignment of the data. 
The program performs a series of pairwise alignments using the 
Myers and Miller global alignment algorithm, calculates sim- 
ilarity and identity, and then places the results in a distance 
matrix (Campanella et al., 2003). Sequences were aligned with 
Arabidopsis At TCTP 1 and CmTCTP as type sequences, because 
biological function of the first one is better known, while the 
latter, albeit less well studied, appears to have a different func- 
tion according to results in our group (see below; Hinojosa-Moya 
et al, 2013; Toscano-Morales et al., submitted). 

Sequence comparison at the amino acid level of TCTPs from 
different species showed a similarity in the range of 80.4-98.2 and 
82.1-94% compared with At3gl6640 (AtTCTPl) and CmTCTP 
respectively (Table S2). On the other hand, the percent of 



Table 2 j Structural classification of TCTP from plants. 



Organism 

C. subellipsoidea 
C. meroleae 
P. patens 

S. moellendorffi 
O. sativa 

B. distachyon 
S. bicolor 

Z. mays 
S. italica 

A. coerulea 

P. virgatum 
S. tuberosum 

S. lycopersicum 
M. guttatus 

V. vinifera 

C. sinensis 

C. Clementina 
G. raimondii 

T. halophila 

B. rapa 
A. lyrata 
G. max 

P. vulgaris 



AtTCTPI-like 

C-169_65285 
CMQ113C 



179722 

Os11g43900 

Bradi4g10920 

XP_002453140 

GRMZM2G108474_T01 

Si026772m 

Aquca_003_00740 



CmTCTP-like 



XP_001757363; 
XP_001 758666 



SolycOl g099780.2 



1.1g030941m 

Ciclev10006071m 

Gorai.005G060700 

BAJ33998 
Bra022172 
XP_002885160 



Aquca_017_00176; 
Aquca_035_00202 

Pavirv00039226m 

PGSC0003DMT 
400063579 

Migut.G00151; 
Migut.N02086 

GSVIVT01 01 7723001; 
GSVIVT01 031135001 

Ciclev10002699m 

Gorai.007G300300; 
Gorai.013G126000 

10022380m 

Bra001637; Bra021187 

XP_002884515 

Glyma10g29240; 
Glyma09g04950 

Phvul.007G197200; 
Phvul.009G248700 



M. truncatula 

F. vesca 

M. domestica 


Medtr1g083350; 
Medtr6g071090 
mRNA06814.1-v1.0- 
hybrid 


MDP0000164046 


P. persica 


ppa009639m 




C. sativus 


Cucsa. 253020 


Cucsa. 181820 


C. melo 


Melo3c006670p1 


Melo3C015297P1 


C. lanatus 


Cla021747 


Cla005200 


C. maxima 


N.D. 


ABC02401 


P. trichocarpa 


Potri.005G024800; 
Potri.008G226500 


Potri.010G013400 


L. usitatissimum 


Lus10033959 




R. communis 




29726. m004052; 



M. esculenta 



cassava4.1_025245m 



30128. m008835 

cassava4.1_017738m; 
cassava4.1 017756m 



iContinued) 
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Table 2 | Continued 

Organism AtTCTP1-lil<e CmTCTP-lil<e 

C. rubella EOA33129.1 EOA31592.1 

C. papaya evm. model. supercontig. 

1597.1; 

evm. model. supercontig. 
327.3 



Modeling of predictive 3D protein structure for each plant TCTP was carried out 
(Figure S4) and structure comparison was assessed to classify each plant TCTP 
based on their predictive structural relationship to either Arabidopsis thaliana 
AtTCTPl or the Cucurbita maxima CmTCTP 

identities ranged from 70.2-95.2 to 71.2-88.1 with the refer- 
ence AtTCTPl and CmTCTP sequences respectively. These ranges 
did not include the percent of similarities/identities with the 
microalgae C. subellipsoidea and the red algae C. merolae. 

PHYLOGENY OF PLANT TCTP PROTEIN SEQUENCES CANNOT BE 
RESOLVED SATISFACTORILY 

Protein sequences used in this analysis were retrieved from the 
Phytozome database. The amino acid sequences chosen were 
aligned with the MUSCLE program (Figure SI). A phylogeny 
of the aforementioned sequences was generated by MEGA 5.0 
software by using the LG+G method with 1000 bootstrap replica- 
tions. Phylogenetic analysis based on the TCTP protein sequence 
from different organisms groups may be informative with respect 
to the evolution of TCTP per se. But this also depends on which 
outgroup was used to root the tree. If this consists of a pro- 
tein with a similar function, the tree may reflect the functional 
similarities between these proteins. On the other hand, an out- 
group consisting of a sequence derived from an early branching 
event may yield more information on the phylogenetic relation- 
ship between the taxa harboring this protein (Hinojosa-Moya 
et al., 2008). For this analysis, TCTP sequence from the red algae 
C. merolae was used as outgroup. However, the resulting tree 
could not be resolved satisfactorily (Figure S2). 

PREDICTIVE PROTEIN STRUCTURE OF PLANT TCTPs AS A PARAMETER 
TO ESTABLISH POTENTIAL FUNCTIONAL DIFFERENCES 

Since phylogenetic analysis did not provide sufficient information 
to suggest functional differences between plant TCTP sequences 
could be inferred; thus, other criteria were used to help estab- 
lish such differences, if any. We had previously reported that 
while TCTP structures are highly conserved, there are subtle dif- 
ferences among some members of this family. Indeed, TCTP 
from Plasmodium falciparum and P. knowlesi harbor an extra 
a-helix close to the N terminus, not found in other TCTPs 
(Hinojosa-Moya et al., 2008); this causes a structural distortion 
in the pocket region that may be involved in the interaction with 
GTPases. We have recently found that such structural differences 
are reflected in the capacity of human and P. falciparum TCTP to 
induce B cell proliferation, as well as its incorporation into these 
cells (Calderon-Perez et al., 2014). Thus, we assumed that plant 
TCTPs could also show structural differences from which func- 
tional specialization could be deduced. Therefore, TCTP amino 



acid sequences from each plant species whose genome has been 
sequenced were obtained from the Phytozome database, followed 
by direct comparison to sequences from other databases in order 
to avoid ESTs or repeated sequences. A total of 59 amino acid 
sequences were submitted to the automated protein structure 
homology-modeling server SWISS-MODEL (http://swissmodel. 
expasy.org/) that generated a homology model for each sequence. 
The respective model was visualized and modified to generate the 
3D-models using the Swiss-Pdviewer 4.10 application. 

The Arabidopsis genome harbors two TCTP genes, AtTCTPl 
and AtTCTPl (At3gl6640 and At3g05540, respectively). The 
important role of the former in general growth and development 
has been described before. We have reported that the pumpkin 
[Cucurbita maxima) TCTP (CmTCTP) induces cell proliferation 
and regeneration, in conjunction with A. rhizogenes rol genes, and 
is phloem-mobUe; thus, it can be considered that it is not a func- 
tional homolog of AtTCTPl (Hinojosa-Moya et al., 2013). These 
results suggest that all TCTPs fall within these two broad groups, 
one of them related to AtTCTPl, while the other one more related 
to CmTCTP 

AtTCTPl and CmTCTP were used as reference structures to 
determine whether this "division of labor" can be extrapolated 
to other plant species. It may be assumed that such functional 
specialization occurs in plants harboring more than one TCTP 
gene, while in those harboring a single TCTP gene this would per- 
form the combined functions of AtTCTPl and CmTCTP. Finally 
this specialization would be expected only in land plants, and 
especially in vascular plants. 

The 3D-predictive structures of all TCTPs were generated 
using Swiss Model; the templates used to obtain such struc- 
tures are listed in Table S3. These structures presented a quite 
similar conformation. However, we found the "pocket" region 
near the center of the protein (and which may be the GTPase- 
binding domain, at least in fungi and animals) and its flex- 
ible loop as the most divergent regions among plant TCTPs. 
Interestingly, AtTCTPl and CmTCTP meet this criterion of diver- 
gence. Indeed, the latter is predicted to be structurally similar to a 
group of TCTPs including StTCTP and FvTCTP while AtTCTPl 
is more related to other TCTP versions such as Oryza sativa 
TCTP (OsTCTP) (Figure 2). The predictive structures of diverse 
TCTPs support the notion that these proteins fall into two groups, 
AtTCTPl-like and CmTCTP-like (Table 2; Figures S3, S4), and 
based on these their possible functions can be inferred. 

Intriguingly, we found no TCTP genes in several chlorophytes, 
e.g., Ostreococcus lucimarinus, Micromonas pusilla, Volvox carteri, 
and Chlamydomonas reinhardtii, with the exception of Coccomyxa 
subellipsoidea, which harbors one TCTP gene. This suggests that 
other proteins perform the roles played by TCTP in most plants 
are regulated through alternative routes. In contrast, the red algae 
Cyanidioschyzon meroleae harbors one TCTP gene. 

The moss Physcomitrella patens harbors two TCTP genes, 
which encode a protein structurally similar to AtTCTPl and 
the other resembling CmTCTP. In contrast, the tracheo- 
phyte Selaginella moellendorffi and several monocots (such as 
Brachypodium dystachion, Oryza sativa. Sorghum bicolor, Zea 
mays, Setaria italic, and Panicum virgatum) have only one gene 
coding for TCTP (all similar to AtTCTPl). Several eudicots 
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Arabidopsis AtTCTPl 



C. maxima CmTCTP 




0. sativa TCTP 



S. tuberosum TCTP 




FIGURE 2 I structural comparison of Arabidopsis TCTPs. The "pocket " 
structure (highlighted in dashed red lines) and the flexible loop 
orientation indicate probable structural differences between these two 
TCTPs. In the bottom there are two examples of the structural 



classification made for all Plant TCTPs, retrieved from Phytozome 
database (http://phytozome.net). O. sativa TCTP (OsTCTP) is structurally 
related to AtTCTPl while S. tuberosum (StTCTP) is predicted to be 
structurally similar to CmTCTP 



harbor only one gene, but others contain two or more TCTP genes 
(Table 2). In the cases in which species have only one TCTP gene, 
this is usually AtTCTPl -like, as mentioned previously; but in 
some cases it resembles more CmTCTP (as in G. max, P. vulgaris, 
and M. domestica). Since the complete genome of pumpkin is not 
yet available, it is possible that it harbors an additional TCTP 
gene, which on a speculative note would be AtTCTPl -like pro- 
tein. Indeed, considering that C. melo and C. sativus also harbor 
two copies of a TCTP gene, an Af TCTPi -like and a CmTCrP-like 
gene in each case, this is likely. 

AtTCTPI-LKE AND CmTCTP-UKE UPSTREAM REGIONS SHOW 
DIFFERENT MOTIF FREQUENCIES 

Bioinformatic analyses of the upstream sequences from TCTP 
genes in plants identified common motifs in their sequences. 
Both sets of genes display overrepresented CT/GA motifs (motif 
1) with an extremely low E value, indicating that these sets 
are expressed in vascular and/or rapidly growing tissues (Ruiz- 
Medrano et al., 2011; Figure 3). However, since the E value for 
the CmTCTP-like gene promoters is 12 orders of magnitude lower 



than for the corresponding AtTCTPl -like gene promoters, it is 
possible that there are differences in the expression levels or in 
their tissue-specific expression. On the other hand, the AtTCTPl- 
like promoters motif 2 is quite different from the corresponding 
motif 2 from the CmTCTP-like promoters, the former being an 
AC-rich element with a lower E value. Both gene promoters motif 
3 sets is an A-rich element, quite common in upstream regula- 
tory sequences. The shuffled sequences showed overrepresented 
motifs with very high (i.e., non-significant) _E-values (10^-10^). 
These results suggest that the expression profiles of AtTCTPl -like 
and CmTCTP-like gene promoters may not overlap completely. 

DISCUSSION 

The known TCTP interactome indicates a wide variety of activ- 
ities such as microtubule organization, inhibition of cell death, 
translation and ribosome function, DNA processing and repair, 
and modulation of the immune response, among several others 
(Nagano-Ito and Ichikawa, 2012; Amson et al., 2013). This trans- 
lates to roles in general growth and developmental regulation, 
which in plants are less well known. Plant TCTPs show identities 
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FIGURE 3 I MEME analyses of the TCTP promoters. Consensus sequences identified in the promoters regions for botli AtTCTPI-like and CmTCTP-like. Only 
the three motifs having the lowest E-values are shown. 



at the amino acid in the range of 70.2-95.2% and similarities 
of 80.4-98.2%, which support the notion of highly conserved 
functions for these proteins (See Table S2). 

However, because of all these activities it is not clear whether 
TCTP is a multifunctional protein or if several closely related 
sequences have subtly different biochemical properties; this is 
especially relevant in cases where there is more than one gene 
per genome. For example, in humans, there are four TCTP genes, 
one of them corresponding to the histamine release factor (HRF), 
which is secreted. It is not known if this is the only TCTP that 
is secreted. Similarly, it is not clear if these genes display a com- 
mon expression pattern. It is reasonable to assume that there is 
functional specialization among them. Another general assump- 
tion is that these proteins are structurally similar; however, our 
group has found that in some particular cases subtle structural 
differences underlie functional variations in TCTP. Indeed, the 
P. falciparum and P. knowlesi TCTP harbor an additional a-helix 
in the N-terminal domain (amino acids 22-30), which introduces 



an overall distortion in its structure, particularly in the central 
pocket region which may be involved in the recognition and 
binding of GTPases such as guanine-nucleotide exchange factors 
(Hsu et al., 2007; Hinojosa-Moya et al., 2008; Calderon-Perez 
et al., 2014). It must be mentioned that the structures of the 
P. falciparum and P. knowlesi TCTPs have been experimentally 
determined (Vedadi et al, 2007; Eichhorn et al, 2013). Although 
this last activity has not been shown in vitro, it is clear that this 
domain is important for TCTP function. The P. falciparum TCTP 
is secreted into its host bloodstream, and can reach high titers, 
albeit its role in pathogenicity is not clear (MacDonald et al, 2001; 
Pelleau et al., 2012). Since HRF induces B cell proliferation, we 
suggested that the P. falciparum TCTP, which also induces his- 
tamine release (MacDonald et al., 2001), may mimic such activity, 
although not as efficiently as its host counterpart (Hinojosa-Moya 
et al, 2008). We have shown that P. falciparum TCTP induces 
B cell proliferation at low levels, but shows a different localiza- 
tion pattern when internalized into B cells (Calderon-Perez et al.. 
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2014), indicating that its structural distortion leads to a novel 
activity (possibly interfering with the host immune response). 
Interestingly, the structural variation of Plasmodium TCTPs was 
also observed in the CmTCTP-like proteins from plants, suggest- 
ing also functional variation relative to AtTCTPl -like proteins. 
Indeed, the pocket region, which is formed almost invariably by 
glutamic acid residues at positions 12 and 134 and leucine at 74 
(using the numbering of residues from the Schizosaccharomyces 
pombe TCTP as reference) has an "open" conformation in most 
non-plant TCTPs and AtTCTPl -like proteins, while it displays a 
"closed" conformation in P. falciparum and P. knowlesi TCTPs, 
as well as in CmTCTP-like proteins (Figure 4). Interestingly, 
there are a few extant non-plant TCTP sequences that have 
substitutions in one of these three conserved residues, but no 
such substitutions were found in plant sequences. As with the 
Plasmodium genus, the AtTCTPl -like and CmTCTP-like proteins 
may display different biological activities, at least in the case of 
the type members of these groups in Arabidopsis and in pumpkin 
(Hinojosa-Moya et al., 2013; Toscano-Morales et al., submitted). 



Indeed, CmTCTP (and, intriguingly AtTCTPl), but noi AtTCTPl 
are capable of inducing whole-plant regeneration when harbored 
by A. rhizogenes (although not by A. tumefaciens, suggesting the 
involvement of rol genes and/or auxins in this process). It has 
been shown that AtTCTPl has a central role in the regulation 
of cell proliferation, although not postmitotic growth (Berkowitz 
et al, 2008; Brioudes et al., 2010). This could be extrapolated to all 
AtTCTPl -like proteins, while CmTCTP-like proteins could have 
a more direct role in developmental reprogramming. 

The phylogenetic relationship of plant TCTPs showed a large 
polytomy; thus, it could not be resolved satisfactorily. According 
to a previous analysis, in the cladogram of plant and non-plant 
TCTPs, two polytomies were likewise observed within the plant 
and vertebrate clades. In the case of plants, there were two groups, 
one of which being highly structured and reflecting the known 
phylogeny of angiosperms. However, the other group, whose phy- 
logeny was not resolved, included sequences from both dicots 
and monocots in a polytomy (Hinojosa-Moya et al., 2008). It 
is generally accepted that phytogenies based on a single gene 



S, pombe (SpTCTP) 



P. falciparum (PfTCTP) 




FIGURE 4 I Structural comparison between Arabidopsis and 
Plasmodium falciparum TCTPs. The "pocket" structure (highlighted in 
dashed red lines) and the flexible loop orientation indicate probable 
structural differences between these TCTPs. As can be observed, the 



pocket region of AtTCTPl and S. pombe TCTP (SpTCTP) have a "closed" 
conformation, while in CmTCTP and R falciparum TCTP (PfTCTP) have an 
"open" conformation, that may cause altered interaction with its potentia 
targets, or with different factors. 
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or protein sequence are prone to artifacts and tend to be inac- 
curate due to uneven rates of substitution. As shown in this 
and previous studies, this is the case for TCTP in plants, at 
least with the sequences used for the phylogenetic analysis. As 
substitution rates vary among members of the same lineage, 
as is suggested in plants, the data reinforce possible differential 
selective pressure on TCTP sequences. We expected that using 
a TCTP sequence from a phylogenetically distant organism (a 
red alga, C. merolae) as outgroup to root the tree could reflect 
the functional similarities between these proteins; nevertheless, 
phylogenetic relationships could not be established either. Low 
bootstrap values underscore the high similarity of plant TCTP 
sequences. 

EMERGENCE OF CmTCTP mL GENES AND DIFFERENT ROLE OF THE 
CORRESPONDING PROTEINS 

In general terms, it appears that CmTCTP-like genes could arise 
roughly during the emergence of land plants through gene dupli- 
cation of an ancestral AfTCrPi -like gene (which is deduced from 
the nature of the extant TCTP sequence from C. subellipsoidea). 
The absence of this gene in most chlorophyte genomes could 
be due to technical problems, explained if it is located close to 
centromeres or telomeres, which are regions notoriously difficult 
to sequence. However, if these genes are really not present, then 
other proteins must perform the function carried out by TCTP, 
the nature of which is not clear. The loss of TCTP genes in these 
lineages cannot be discarded either, but a more thorough analy- 
sis of the regions flanking the TCTP gene in C. subellipsoidea and 
the homologous regions in other chlorophytes may help elucidate 
this question. In any case, the plant TCTP interactome could help 
elucidate the pathways regulated by the proteins that replaced 
them in these species, such as regulation of cell cycle duration 
and proliferation (Brioudes et al., 2010). In the case of AtTCTP2, 
it arose recently, before the divergence of the Ambidopsis genus; 
indeed, the position of the A. lyrata AtTCTP2 homolog in the 
genome is equivalent to its Arabidopsis counterpart. At TCTP 1 
and AtTCTP2 show an identity superior to 80%; the most visi- 
ble difference is a 1 3 aa deletion of AtTCTP2 relative to AtTCTP 1 . 
No other TCTP harbors such deletion, except Giardia lamblia 
TCTP. Interestingly, the deletion falls in the same region (the 
flexible loop) as in AtTCTP2; this loop may be an hypervariable 
region that in all predicted TCTP structures falls outside of the 
domains known to be involved in some of its activities (Ca+^ and 
microtubule binding for example). However, it appears that dis- 
tortions in such domain may alter the overall conformation of 
these proteins and hence their function. 

The analysis of the predictive 3D conformation shows that it is 
feasible to infer that there may be a division of the labor among 
plants harboring multiple TCTP genes, and moreover that in 
some cases the unique TCTP version could perform several func- 
tions. Clearly, experimental evidence must be obtained in order to 
test this hypothesis; however, this can be done in a straightforward 
manner; if this is correct, all AfTCTPi -like genes will be unable to 
regenerate tobacco plants via A. rhizogenes mediated inoculation 
of leaf explants, in contrast to CmrCTP-like genes (Hinojosa- 
Moya et al., 2013; Toscano-Morales et al., submitted). Given that 
the same underlying mechanisms of cell proliferation operate in 



plants and other eukaryotes, this simple assay could be used to 
determine whether a TCTP gene is involved in cell proliferation 
or cell (developmental) reprogramming. 

CONCLUDING REMARKS 

The exact mechanism through which TCTP regulates plant 
growth and development is not clear. The fact that there may 
be at least two different groups of TCTP genes in plants, one 
more involved in the regulation of cell proliferation, and the 
other one in cell reprogramming, was inferred from the activity 
of two proposed representative members of this protein family in 
plants, AtTCTP 1 and CmTCTP from Arabidopsis and C. max- 
ima, respectively. On the other hand, such differential activity 
correlates with subtle structural variations of the predicted three- 
dimensional structures of these proteins. Interestingly, these 
structural variations are also present in other eukaryotes, which 
may also correlate with functional variation. We propose that 
most plant TCTPs fall within one of these groups, and that a sim- 
ple assay, i.e., regeneration of tobacco when harbored by A. rhizo- 
genes, could help determine a role in cell proliferation alone or cell 
reprogramming. 
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abstract 

Figure SI | Protein sequence alignment. The amino acid sequernce 
aligrnment was performed using the MUSCLE software on plant 
sequences included in this work. 

Figure S2 | Phylogenetic analysis of the TCTP in plants. C. maxima 
translationally controlled tumor protein (TCTP) sequence (GenBank 
Accession No. ABC02401) was retrieved form NCBI (http://www.ncbi.nlm. 
nih.gov) and used as query. Protein homologs were identified by BLAST 
search against the phytozome database (http://phytozome.net). Full-length 
protein sequences were aligned with MUSCLE (http://www.ebi.ac.uk/ 
Tools/msa/muscle). ProtTest program (PROTTEST 2.4, Abascal et al., 
2005) was used for the selection of the model of protein evolution that 
best fitted the set of sequences. Evolutionary analysis was conducted in 
PhyML 3.0 (Guindon et al., 2010) and MEGA5. The evolutionary 
phylogenetic reconstruction (shown as dendrogram) was inferred using 
the LG-hG evolution method and the following parameters: number of 
substitution rate categories 4, gamma shape parameter 0.642. Bootstrap 
values higher than 60% are shown (1000 replicates). The analysis involved 
55 amino acid sequences. 

Figure S3 | Predictive 3D structure comparison of AtTCTP-like proteins. 

Full-length protein sequences were selected from phytozome database 
(http://phytozome.net) and refined through a comparison with NCBI 
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database (http://www.ncbi.nlm.tnih.gov). Amino acid sequences were 
submitted to the automated protein structure homology-modeling 
program server SWISS-MODEL (http://swissmodel.expasy.org/), this 
server builds a model for each protein target using as templates 
homologous protein structures which have been experimentally proved 
(Arnold et al., 2006; Kiefer et al., 2009). Swiss PDB Viewer application 
(http://www.expasy.org/spdbv/) was used for visualizing predictive 3D 
structures (Guex and Peitsch, 1997). 

Figure S4 | Predictive 3D structure comparison of CmTCTP-lil<e proteins. 

Full-length protein sequences were selected from phytozome database 
(http://phytozome.net). Amino acid sequences were submitted to the 
automated protein structure homology-modeling program server 
SWISS-MODEL. Swiss PDB Viewer application (http://www.expasy.org/ 
spdbv/) was used for visualizing predictive 3D structures (Guex and 
Peitsch, 1997). 
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